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Abstract. An unbinned statistical test on cluster-like de- 
viations from Poisson processes for point process data 
is introduced, presented in the context of time variabil- 
ity analysis of astrophysical sources in count rate experi- 
ments. The measure of deviation of the actually obtained 
temporal event distribution from that of a Poisson process 
is derived from the distribution of time differences between 
two consecutive events in a natural way. The differential 
character of the measure suggests this test in particular 
for the search of irregular burst-like structures in exper- 
imental data. The construction allows the application of 
the test even for very low event numbers. Furthermore, 
the test can easily be applied in the case of varying accep- 
tance of the detector as well. The simple and direct use 
of background events simultaneously acquired under the 
same conditions to account for acceptance variations is 
possible, allowing for easy application especially in earth- 
bound 7-ray experiments. Central features are the fast and 
simple calculation of its measure, and the existence of an 
analytical approximation that describes the general test 
statistics to a high degree of precision. 



astronomy, this test deals with discrete registration times 
of single events, i. e. point process data. This situation is 
typically encountered in 7-ray astronomy. 

It is dedicated to two basic, different, but closely re- 
lated questions that require statistical methods: 

1. Are there significant signals of variability from a cer- 
tain celestial position even if there is only a moderate 
or hardly significant DC excess from that direction, if 
the observation is limited by background fluctuations? 

2. How well is the sequence of arrival times of pure (or al- 
most pure) events from a well-known source described 
by a constant flux? 

Considering the case of known or assumed periodicities 
of activity of the objects in question, several statistical 
tests are established to search for fixed frequencies in point 



process data (see e.g. Mardia 1972, Lewis 1994). 

For the remaining case of irregular temporal activ- 
ity schemes, the situation concerning established tests is 
less satisfying. In many publications, more or less spon- 
taneously invented measures of variability are used, based 
mostly on a procedure of binning the event data in equally 
Key words: methods: data analysis - methods: statistical sized time slices and subsequent application of a x 2 test. 

Based on this idea, some rather elaborated tests have 

been worked out, binning the data into a lot of different 
ways under variation of bin phases and widths (e. g. Col- 
lura etal. 1987, Biller etal. 1994). The measure of vari- 



1. Introduction 

In the search for high energy astrophysical 7-ray sources, 
be it with earth-bound experiments or with satellite borne 
detectors, the test on an integral (DC) excess of event 
numbers from the direction of a source, compared to an 
adequately derived background expectation, is certainly 
the major statistical tool. (Fo r a q uite general considera- 
tion of this topic, see Li&Ma 1983|) . However, an integral 
excess of events is not the only characteristic one might 
be interested in. Many 7-ray sources are known to exhibit 
highly variable fluxes. 

In this paper a newly developed test on variability for 
photon counting experiments is presented. In contrast to 
time series analyses, in which sampled continuous func- 
tions are examined and which are applied e. g. in optical 
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ability is then obtained by a certain combination of all 
single x 2 test results. Such tests typically suffer from two 
deficiencies: 

1. The chance probability for a given probe can in general 
only be assessed by M t0 Carlo means, since the single 
X 2 test results are not independent 

2. The computation of the measure itself is quite expen- 
sive expressed in terms of computing time and storage 
needs since the data has to be divided into a lot of 
different binning schemes 

Other drawbacks in certain situations are the restriction of 
those tests to sufficiently high event statistics, and the fact 
that the rigorous employment of simultaneously acquired 
background events to account for a non-uniform temporal 
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acceptance of a detector is not obvious. In addition, the 
procedures are by far too complicated to apply them to a 
huge number of different potential source directions, e. g. 
using them for a kind of all-sky search for variable sources. 

It should not be concealed that there exist alternative 
variability analysis procedures for point process data (see 
e. g. Scargle 1998 or Gregory & Loredo 1996 for an applica- 



tion of Bayesian methods). These are surely appropriate 
to attack "higher order questions" such as typical time 
scales of variability or change points of fluxes, but the ap- 
plication is at least as complicated as the use of binned 
tests. 

These difficulties may well bring down the willingness 
of experimenters to apply those tests. The unbinned exp- 
test that will be introduced in the following is, in con- 
trast to the tests mentioned above, easy to apply, using 
a straightforward, natural measure for variability. The re- 
sult of it will therefore not be as specific as for most of 
the above tests, but will supply a measure for variability 
in general. 

It should be emphasized that one established test 
of this kind is already existing: the nonparametric Kol- 
mogorov test, applied to the cumulated distribution func- 
tion (cdf) of the registration times of the events in ques- 
tion, comparing them to the expected equal distribution. 
In opposition to it, the exp-test will have a more differen- 
tial character, thus complementing the Kolmogorov test 
in a sense that will be specified later. 

Although described here in the context of time vari- 
ability analysis, the application of the exp-test is not re- 
stricted to the time domain, but may be used in the a- 
nalysis of e. g. spatial or frequency data as well. It might 
also be found useful in other disciplines of science where 
the analysis of empirical data plays a role, like biology, 
economics and engineering. 

In the following section, the exp-test is developed for 
the ideal situation of a uniform or exactly known temporal 
acceptance function of the experiment with respect to the 
events from a potential source. Section 3 deals with the 
generalization needed to apply the exp-test with simulta- 
neously registered background events, while in section 4 
the sensitivity of the test is characterized. In section 5, a 
brief comparison with the Kolmogorov test is carried out, 
and in the last section a summary is given. 

2. Developing the exp-test 

Supposed a uniform temporal acceptance of the used de- 
tector is given, the expected temporal sequence of regis- 
tered events in absence of a variable source is governed by 
Poisson statistics. This applies to both background events 
and events from a steady source, thereby defining the zero 
hypothesis for any test on variability in this context. 
One starts with the Poisson distribution 



Px(n) 



A™ 



where A denotes the expectation value. 
A monotone sequence (U) of events in time t is called a 
Poisson process, if there exists a constant C, such that 
for all At > 0, dividing the time in equally sized intervals 
of the length At, the numbers of events per time inter- 
val are Poisson distributed with A = At/C and mutually 



independent (see e.g. Chatfield 1994). From this immedi- 
ately follows the probability density function (pdf ) of the 
time intervals At between two consecutive events as the 
derivative of P\(0) with respect to At 



/c(At) 



P(At+dAt)/c(ty - P(At)/c(0) 

dAt 
<iexp(-At/C) 



dAt 



-•exp 



At 



(2) 



i. e. an exponential distribution in At with the mean 
value C. 

Let there now be a randomly chosen time interval, 
containing the monotonically increasing sequence of event 
times (Tj)i=i...jv+i- Let the resulting distribution 



{AT;} 



i=l...N 



■= {{Ti+i — Ti)}i—i. 



.N 



(3) 



have a mean value AT =: C*. If one sets C = C* in Eq.|| 
(i.e. setting C to the actually obtained mean value), it 
follows for the ATj to stem from a distribution fc*(At), 
under the constraint of the conserved mean value. This 
is generally valid, even for a priori unlikely values of C* , 
because the Tj are completely uncorrelated. This ensures 
that any analysis based only on C* is independent of a 
possibly present DC excess or deficit. 

Now the distribution of the AT^ can be compared with 
fc(At) using a distribution test. The classical tests (e.g. 
Kolmogorov test or Smirnov-Cramer-von-Mises test) are 
neither well suited to measure exactly the effect that is 
searched for, nor is it possible to take the constraint of 
the conserved mean value into account. For this reason a 
natural measure will be introduced here that is especially 
sensitive to excesses of the ATj far from the mean value. 

Defining the pdf F(At) of the random probe 



1 N 



(4) 



i=l 



(1) 



the equalities of the 0. and 1. momenta of fc* and F serve 
as a starting point: 

oo oo 

J fc {At) dAt = jF{At)dAt = 1 (5) 



oo oo 

J At- fc (At) dAt = J At ■ F(At) dAt = C* (6) 
o o 

The variance of the distribution F(At) could be used as a 
measure for event clusters (and corresponding dilutions), 
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but this quantity is too sensitive on very large AX^ due to 
the extreme assymmetry of the exponential distribution. 
(Actually the value of the variance is dominated by the 
pdf between C* and oo, although only 37% of the ATi 
are contained in that interval.) In order to be not too 
sensitive to outliers, another way will be followed. From 
Eqs.|[^ follows 

oo 

/ (l-^)./<7.(A*)dAt 
" := h(At) ' 

OO 

= J (l - ^\ ■ F(At) dAt = 



(7) 



Defining now 



X 

U{x) := J h(At) - fa* (At) dAt 



(8) 



it can be concluded 

- H(0) = H(oo) = 

— H(C*) = 1/e ist the global maximum of H, as fc* 
is positive everywhere, and h(At) is monotonically 
falling, being zero at C* 

Replacing in Eq.|| fc» by F, the first property holds, and 
the global maximum is again found at C* , but varying 
around a mean value near 1/e. (The transition from C, 
i.e. the expected value, to C*, the actual mean value of 
the random probe, slightly reduces this value .[] This fact 
will be investigated later.) Hence one has 



c 

M(F) := /(i-^) -F(At)dAt 
o 

(with C* — j AtF(At) dAt) 



(9) 



being a functional on the set of all pdf defined on [0, oof, 
with the property M(/c) = 1/e. 

Provided C* is already computed, M can be calculated 
from the pdf of the ATi between and C* alone. It will 
be higher than expected if untypical excesses of small ATi 
are present (i. e. in the case of burst-like phenomena), and 
lower than expected for untypical excesses of ATi near C* 
(i. e. for untypically regular temporal structures compared 
to the average Poisson process). 

Fixing the extreme cases 

1. the heartbeat function 
V(At) :=S(At-C*) 

belonging to the case of all ATi being equal, leading 
to M(<?) = 0, and 



1 Random probe mean values are known to be opportunist. 
By definition they are minimizing the variance, and therefore 
do also decrease the measure of spread defined here. 



2. the needle peak function 

*(At) := lim e ^o (1 - e) • S(At) + e ■ S(At - C*/e)) 
characterizing the case of almost all ATi — 0, and the 
rest (e. g. one of them) being huge, resulting in the 
mean value of C* , which leads to M(^) — > 1. 

it can be noted for any pdf on [0, oof with a mean value 
ofC*: 



M(F) € [0,1[ 



(10) 



From the above it can be concluded that M defines a 
well-behaved and natural measure of variability, therefore 
M is chosen to be the measure of the exp-test. It should be 
stressed that only the spectrum of time intervals between 
two consecutive events is entering, thus giving this test a 
rather differential character. 

It remains the derivation of the actual distribution 
function for the M(F) for given N (in the following de- 
noted distribution) under the zero hypothesis. In any 
case, this pdf is independent on the particular value of 
C* , since this constant only scales the time, whereas M 
is a dimensionless quantity. Analytically exact solutions 
are surely possible but not attempted here, because a 
considerable expenditure has to be expected. Instead, a 
semi-analytical approach is followed, using the fact that 
all spreads occuring here are resulting from spreads in the 
polynomial distribution, all of them scaling with y/N, thus 
leading for dimensionless variables such as M to asymp- 
totic standard deviations a oc 1/y/N. For a similar reason, 
the difference of the mean value \x and 1/e will be, again 
at least asymtotically, proportional to 1/N. Furthermore 
it can be expected, according to the central limit theorem 
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Number N of At intervals in the random probe 

Fig. 1. The standard deviations cr(Mjv) • y/N and the 
shift of the mean value (1/e — /i(Mat)) • N of the Mat 
distribution from M te Carlo calculations. Shown are also 
the weighted mean values (straight lines). The errors for 
the spread values (circles) are indicated by the size of the 
central spot. Sample sizes are 1.3- 10 5 for each tested value 
of N (except for N — 10 5 with a sample size of 1.3 • 10 4 ). 
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Fig. 2. Comparison of the Mjv distribution with the normal distribution N(l/e — a/N, (3/VN): shown are the chance 
probabilities from the normal distribution (solid lines) and the relative frequencies from M to Carlo calculations for Mat 
(symbols and dashed lines) against a. Each sample distribution consists of 1.3 • 10 5 events. 



of statistics, that for increasing N the distribution as a 
whole will tend towards a normal distribution. Therefore 
only two constants a, (3 need to be determined, for which 
the following asymptotic scaling laws apply: 

- and ^(M iv ) = -= 



m(m 



N) 



(11) 



In simulations of Poisson processes by generating time se- 
quences with the help of a pseudo random number gener- 
ator, there could not be found any significant deviations 
from Eq.[fl] for N ranging between 20 and 10 5 (using 
1.3 • 10 5 individually generated sequences per considered 
number of N, see Fig.|). With weighted means of the 
obtained values, a, (3 could be constrained to 



a = 0.189 ±0.004 



f3 = 0.2427 ±0.0002 



(12) 



There is only one faint (but significant) deviation from 
the y/~N scaling law of the spread at N = 10 (the actu- 
ally obtained number there is a ■ \/l0 = 0.2400 ± 0.0005). 
However, the use of the scaling law at N = 10 thus corre- 
sponds to an error of ~ 1% on a significance scale, which 
is negligible in almost all cases. 

After having determined the mean values and the 
standard deviations with sufficiently high precision for 
most purposes, only the shape of the M^v distribution 
has to be compared to that of a normal distribution 
N{\/e — a/N,(3/^/N) with the same mean and variance. 
In Fig. U the generated relative event frequencies are com- 
pared to N(l/e — a/N, f3/\/N) -distributed probabilities, 
for examples oi N = 10, 100, 1000. Due to the fact that 
/i(Mjv) ~ 1/e is not in the middle of the allowed interval 
([0, 1[) for Mjv, a faint positive skewness of the M^v distri- 
bution can be noticed for smaller N. The peculiar drop 
of frequencies towards larger negative significances for 
N = 10 results from this, too, since M 10 = corresponds 



to ~ — 4.5 a. Such an effect is already hardly noticeable for 
N = 20. One notes that already for N — 10 and moder- 
ate positive deviations from the expectation value (<2<r) 
the actual Mjv distribution follows the normal distribution 
with a good precision, whereas for N = 100 the approxi- 
mation with the normal distribution is globally valid for 
most practical cases. So only for extremely high precision 
requirements individual M te Carlo calculations are neces- 
sary to assess the exact significance of an effect. 

3. Using background events for the exp-test 

The exp-test as developed so far can be readily applied 
in data analyses of experiments that have a sufficiently 
constant temporal acceptance. In the case of a varying ac- 
ceptance function Eq.^ is not valid, as one cannot expect 
a Poisson process any longer. (This case is sometimes re- 
ferred to as inhomogenous Poisson process, see Chatficld 



1994.) 



If the temporal acceptance is discretely switched on 
and off, the problem can be easily solved by excluding the 
time intervals during which the data aquisition was actu- 
ally switched off or blocked (e. g. due to electronics dead 
time). In the general case (continuously varying accep- 
tance a(t), which is the typical case in earth-bound 7-ray 
experiments) , the variable t has to be replaced by a scaled 
effective time r (a(t)-dt — > cLt), in order to apply the exp- 
test as defined above, but this would require a precise and 
complete knowledge of the temporal acceptance function. 
However, a simple reasoning shows that background event 
times (t]3G ,j ) (being from a Poisson process in r as well), 
simultaneously collected with the event times (U) of inter- 
est, could serve the same purpose if the background has a 
sufficiently high temporal density (e. g., in the case of cos- 
mic rays, if the background events stem from a solid angle 
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which is large compared to the one that defines the "on- 
source" events). In this case no explicit knowledge about 
a(t) is required any longer. Instead, one can use the dis- 
tribution of the ANbg between two consecutive on-source 
events directly, thus testing, so to speak, whether the two 
distributions are mutually Poisson distributed. 

Proceeding for this purpose from the pdf of the gener- 
alized time intervals At (see Eq.^) between two on-source 
events (the actual acceptance function a(t) does not play 
a role in the end; the only exploited feature is that it scales 
the on-source and the background distributions simulta- 
neously) : 



1 (At 

/ on (At) = — exp - — 

^on \ ^on 



(13) 



with a certain, fixed C on . 

Now let the background events be from a Poisson process 
as well, with an expectation value Abg m a generalized 
time interval At of 

Abg(At) = At/Cbg 

From Eq. [l3| follows the distribution of the mean values for 
the number of background events between two on-source 
events: 

I i Abg 



/(A 



BG ) 



c 



■ exp 



C 



(14) 



(with a global expectation value of C :— C oi1 /Cbg) 
The complete pdf for the number of background events be- 
tween two consecutive on-source events (in the following 
called "Inter-Events") can be derived now as a weighted 
(with /(Abg)) integration of the Poisson distribution 
over A B q: 



Pa bg (see Eq 



wc{n) 



p A BG (n) ■ /(A B c)dA 



BG 



A BG =0 
1 



C+l \C+1 



c 



(15) 



i. e. a discrete exponential distribution. Expressed with the 
help of the exponential function one yields 



wc(n) 



C 



TT cxp {- log ( 1 + h 



(16) 



(compare Eq. ^|). Thus the number of background events 
can be used as a substitute for a generalized clock (mea- 
suring r intervals). The precision of this "clock" will be 
defined by C (i. e. the ratio of the number of background 
events to the number of time intervals to test, or, equiva- 
lent, the mean value of all Inter-Events). 

Analogously to the definitions in the last section let 
there be a frequency distribution W(n) of Inter- Events 
from a random probe of N intervals, with a mean value of 



ra=0 



n ■ W(n) = C* 



(17) 
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Fig. 3. The dependence of (3c* and etc* from C* from 
M te Carlo calculations (see text). The solid line (left) 
shows the least squares fit (repeated with dashes at the 
right hand side). The sample sizes for each considered C* 
are 1.8 • 10 5 . 



K 



(18) 



Eqs.|5|, ^| are valid analogously, and one writes 
with H(-l) =H(oo) =0 

with the maximum found at K = [C].Q The value at [C] 
depends obviously on C itself, but tends again to 1/e if 
C -> oo. With 



[C* 



n 



W{n) ( C* from Eq. 17) (19) 



again M(W) € [0, 1[, and for W(n) from Inter- Events in 
the case of a Poisson process, that M(W) has a mean 
value near 



M (C* 



\C* 



1 



c* 



1 



c* 



C* + l 



[C* 



(20) 



and a variance that depends this time also on C* . 

Naming the mean value resp. the standard deviation 
of the corresponding M.c*,n distribution /j,c*(N) and 
er<7» (N) , respectively, and defining (analogously to Eq. [ll]) 



M (C*) - Mc-(iV) 



N 



(21) 



then empirically the following parametrization can be 
found: 



ac* ~ ct ■ k x 
with ki = 1.67 ± 0.02 



(3 C * w ■ kf +k2 (22) 
k 2 = 0.37 ±0.03 



(a, (3 from Eq.JL2|) 

The least squares fit that leads to k\ and fea is shown 
in Fig.^ (left). Since the spreads can by far more accu- 
rately be assessed than the mean values, again the fit 
to them provides the more precise numbers. However, 

2 Here and in the following the pair of square brackets [. . .] 
denotes the integer function. 
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in Fig. | (right) it can be seen that the dependency of 
ac* I ol on C* is compatible with the same parametriza- 
tion, so that the same functional behaviour may well 
be assumed. Summarizing, it can be stated for the dis- 
crete case that was considered here, that the Mc*,n 
distribution equals approximately a normal distribution 
N(Mo(C*) - a C */N,/3 c */VN), with only small devia- 
tions similar to the ones shown in Fig. || For C* — > oo the 
distribution of A4c*.n tends in all aspects towards the 
M^v distribution. Since decreasing C* lead to increasing 
spreads /3q* , for optimization of the sensitivity C* should 
be chosen as large as possible. 

As a last general remark that applies to the continuous 
case as well as to the discrete case it should be stressed 
that the measure of the exp-test M(F) or A4(W) can be 
assessed with running sums when one allows for two se- 
quential passes over the data: in the first pass, C* can be 
determined, whereas in the second pass the sum in Eq. || 
or Eq.|l^ can be calculated. This possibility is a major 
advantage in comparison to binned procedures. 

4. Sensitivity of the exp-test 

This section deals with an application of the exp-test in a 
general scenario, in order to characterize its sensitivity. 

Consider the case of a sporadically active source with a 
duty cycle q e]0, 1], during which it "pollutes" the undis- 
turbed background with additional events from another 
Poisson process (i.e., the constellation of the first ques- 
tion mentioned in the introduction). Let the total num- 
ber of registered events be N, the number of background 
events be Nt, and the number of additional events from 
the source be N 2 (i. e. N = Ni+N 2 ). Without loss of gen- 
erality the total observation time is set to 1 (see sketch in 
Fig.|j). The same pattern applies to the case of undiluted 
source events in a search for a signal of variablility (second 
question posed in the introduction), in which case Ni de- 
notes the quiescent flux 4>i ow and N 2 jq + Ni corresponds 
to the flux $hi g h during the active state of an object. 

To begin with the consideration of the continuous Mjv 
statistics, it follows for the pdf F of At to be 

F(At) = -^(1 - q) ■ f Cl (At) + 2+ N q ■ f C2 (At) (23) 



with C\ 



N 



C 2 = 



qN 



Ni N 2 + qNi 

and f Ci ( At) from Eq. 2 

(i.e., according to Eq. [| C* — 1, without loss of general- 

ity) 

From Eq.|] it follows an expectation value for M(F) of 



(M(F)) = -exp(^ 



1 — q + q exp — 



qN 



(24) 



It is important to note that, due to the differential 
nature of the test, F(At) and consequently (M(F)) 



high 




Fig. 4. Sketch of the mean event rates from a source with 
a variable flux (two flux levels) . 



apply not only to a flux pattern as shown in Fig. |], but 
to all two-leveled light curves. More specifically they 
are neither affected by the actual position of the time 
window of the active state, nor by an arbitrary number 
of interrupts of the active state, as long as the total 
duty cycle is kept fixed and N 2 is large compared to the 
number of interrupts. 

In the notion of the 1. question, one will ask for the 
significance of the exp-test result in comparison to the 
significance of a DC excess. For the exp-test, a significance 



-'exp 



N((M(F)) - l/e)/0 



(25) 



can be expected. For any fixed N 2 /N, (S cxp ) is a posi- 
tive, strictly monotone convex falling function of q, being 
zero at q = 1. In the limit of an arbitrary but fixed ratio 
< 1 one gets from Eqs. |J,[25 



(S'exp) — 



e(3 



N 

'-w 2 q 



1 — exp — 



A2 
qN 



N 9 



(26) 



Under the above limit the expectation for the DC sig- 
nificance Sbc m the case of a perfect knowledge of the 
background level is 



(27) 



(Sdc) - N 2 /VN 
Thus 



("Sexp) — 
1 

ej3 



—„ ■ 1 



N , 



exp 



A2 
qN 



■ (Sue) (28) 



1.51 



3 In this consideration of expectation values there is no in- 
fluence from the shift of the mean value a/N. In any case of 
real random probes with finite N, this shift cancels out since 
it concerns the actual probe as well as the expectation. 
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i. e. in the limit q — ► (that means extremely burst-like 
behaviour of the source) one has to expect a significance 
from the exp-test that is 50% higher than the DC result, so 
it can be stated that it is worth to apply the exp-test in the 
case when some DC excess is present from an appropriate 
candidate source. It should be emphasized once again that 
both significances are perfectly independent if the zero 
hypothesis is true. 

Obeying the limit from above and defining q cr n ■— 
Nz/N, for q <C f/crit h follows from Eq. p8| that 



(S cxp ) - 
while for q = 

(>Sexp) 



1 - — ) • (3dc; 

<?crit , 



g cr it one obtains 
1 



(29) 



9=9crit 



-(S e 



q=0 



i. e. in the case of moderate DC significances, only for 
Q ^ 9crit a significance from the exp-test that is worth 
mentioning can be expected. In the context of the first 
question, the exp-test therefore should be regarded as a 
test on burst-like temporal structures. 

In the case of the discrete M.c*,n statistics, qualita- 
tively the same signature for the considered scenario oc- 
curs. One finds for the probability distribution 



Nx 



W(n) = -^(l- ? ).«, 0l (n) 

with Ci = C* ■ — , C 2 =C 
and wci(n) from Eq. 15 



N 2 + qNi 



N 



w C2 {n) (31) 



qN 



N 2 + qNx 



For arbitrary C* it is not possible to give a closed ex- 
pression for (jM(W)) analogously to Eq.^J. However, the 
main feature of the discrete case can be assessed as follows. 
Using this time 



N({M(W))-Mo(C*))/p c * 



(32) 



one finds after lengthy but trivial calculations for integral 
C* and q < N 2 /N < 1: 

C' + l 



(S, 



cxp/ 



c* 



1 



1 



— 

<7crit / 



(5dc) (33) 



C* + l 

Under the limitations stated above this yields for C 
(S cxp ) « — 
w 0.7 

but for C* = 10 already 



1 



- 

<7crit / 



(S 



DC 



(34) 



(S, 



cxp/ 



10 



11 



1 



11/ PlO V ?crit/ 



(35) 



1.4 



Please note in this context, that for the calculation of the 
DC significances a perfect knowledge of the background 
level was still assumed. 
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(30) Fig. 5. Contour lines of the exp-test significance S cxp in 
the q-r plane. This figure allows for the determination of 
the detectable q-r region if the total number of events TV 
and a required minimum value of S cxp are given. 



The overall shape of the function (S cxp ) / (Sue) m 
dependence of q is, apart from the lower absolute values, 
very similar to Eq.|2^. It therefore can be summarized 
that the use of background events qualitatively yields the 
same results as the continuous statistics, while the 
quantitative loss in sensitivity in dependence of C* can 
be obtained from a comparison of Eq. ^9] and Eq. [33|. 

Turning now to the second question mentioned in the 
introduction (pure source events), one will be interested 
in the sensitivity to detect a certain flux ratio < I ) high/ < I ) iow 
when a duty cycle q is given, or vice versa. This shall only 
be carried out here for the continuous Mjv statistics. 

Defining r :— $hi g h/^iow, one gets from Eq.[2^: 



(Se 



N 



(<M(F)> 



1 



N 
e/3~ 



1 — q + q ■ exp — 



1 



• exp 



q+ (1- q)r 

q-q-r 
q + (i -q)r 



(36) 



In Fig. |^ the resulting expectations for the significances 
are displayed as contour lines in the q-r plane. Making 
use of the fact that the the assumed two-level model is 
most efficient to produce an effect on M(F) with respect 
to an upper limit of the ratio of maximum to minimum flux 
levels present for a given source, for a sufficiently high sig- 
nificance ^exp it is immediately possible to deduce a lower 
limit of actual flux variations 'I'max/'I'mm for a source. For 
the given example values of S'exp, these ratios can be read 
off from Fig. |^ as the ordinate values of the cusps of the 
displayed curves. For any given value of S oxp , $ m ax/'I > min 
has to be obtained with graphical or numerical methods, 
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regarding Eq.|36| as an implicit function r(q) with S oxp as 
a parameter. 

It can be concluded that in this case, where the sig- 
nificance is expressed in terms of flux levels in the high 
and the low states rather than in terms of event numbers, 
the exp-test is a quite broad-banded tool in the search for 
variabilities with respect to the duty cycle q. 

5. Exp-test versus Kolmogorov test: a comparison 

In order to recognize the strengthes of the exp-test on the 
one hand and the Kolmogorov test on the other, the latter 
shall be briefly reviewed here. 

The Kolmogorov test uses the maximum D of the ab- 
solute differences of the cdf G(t) of the temporal distri- 
bution of the event sequence and the expected cdf g(t) 
(i. e. an equal distribution in the case of uniform temporal 
acceptance): 



D := max\G(t) - g(t)\ 



(37) 



Being TV the total number of events, it can be shown that 
the chance probabilities Pkoimo hi dependence from D un- 
der the zero hypothesis arc asymptotically distributed ac- 
cording to Kolmogorovs A distribution: 



P D > 



A 

N 



2 ■ 



E 



(-ir 



fe+i 



-2fc 2 A 



(38) 



(Kolmogorov 1933). 

Consider now a distribution on [0, 1] similar to the one 
shown in Fig.^, with one uninterrupted activity phase of 
the length q, containing N% excess events, but this time 
with a specified starting time to € [0, 1 — q]. The corre- 
sponding cdf is sketched in Fig. ||, and the expected differ- 
ence (D) to the equal distribution can easily be calculated: 



„ iV 2 



(39) 



(40) 



{Dq,t ) = (1 - Q - min(i , 1 - 
Averaging over to yields 
{ £) ) = i (1 _ q ) . *Z 

Again the question of the significanc^ one may expect 
from this constellation shall be posed. An easy estimate 
can be obtained under the assumption that D already 
determines the actual D to a sufficient precision in the 
test, that means, a difference D > D is fairly improbable 
under the zero hypothesis. Under this condition, in the 
series in Eq. [58] all but the first term are negligible, and it 
follows 



Pi 



kolmo 



D > D, 



Hi-?) 2 



N 



(41) 



4 Calculating significances for results of the Kolmogorov 
test is rather unusual, but shall be performed here 



easier comparison. With freq(x) := 
-z 2 /2) dz every chance probability Pq cor- 



for the sake of 
(V^)" 1 ./: oo exp(- 
responds to a significance of S = — freq 1 (Pc)- 




duty cycle q 



Fig. 6. Sketch of the cdf G(t) of event times from a source 
being in an active state during [to, to + q] , and the cdf g(t) 
of the zero hypothesis (see text). 



From 



1 1 _1 2 1 
• - • e 2* = _ — 

/2tt x V2tt 



= 11 + 01-2 

. X 



e 2 dz 



e s z dz 



(42) 



Po(x) 



a first order asymptotic expansion of the standard normal 
distribution can be deduced: 



P(S) 



1 



/2vr 



1 

S 



-is 2 



(43) 



from which one obtains in a comparison with Eq.^l] for 
the significance Skoimo of the Kolmogorov test result 



(Skolmo) » ^(1 - Q)~7= w \( l ~ l)( S dc) 



(44) 



Numerical calculations show that this estimation is indeed 
approached for large N and (S^c), while for small TV and 
(Sdc) (Skoimo) is somewhat smaller (Prahl 1999| ). For an 
example of N = 100 and (Sdc) — 3<t the significance 
(Skoimo) is reduced by w 20% for q — > 0. Nonetheless, 
Eq.|4^ may serve as a rough estimate for all practical pur- 
poses. 

It is obvious that for splitted activity intervals (Skoimo) 
will be much lower. In a comparison with the exp-test (see 
Eq.|28|) therefore it has to be stated that for one single 
outburst of a source with q N2/N the expected signif- 
icances from the Kolmogorov test and from the exp-test 
are similar (but slightly higher from the latter), while for 
a comparably long and uninterrupted activity interval the 
Kolmogorov test is more sensitive, whereas the exp-test is 
better suited to find some or many short outbursts. 

Finally, the correlation of both tests under the zero 
hypothesis has to be studied. This is again performed in 
M tc Carlo calculations. It turns out that the correlation 
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Fig. 7. The correlation coefficient between the signifi- 
cance result of the exp-test and that of the Kolmogorov 
test for M te Carlo generated Poisson processes, versus the 
number N of tested events. The sample size for each con- 
sidered value of N is 1 • 10 5 . 



coefficient g between S'koimo an d <S cxp is a monotonically 
decreasing function of N, being g ss 0.56 for N = 10, but 
already only g 0.23 for N = 100. The graph obtained 
for g(N) is shown in Fig.^. One concludes that it is worth 
to apply both tests in a search of variable sources, maybe 
except for very small event numbers. 

It should be mentioned that, like for the exp-test, the 
result of the Kolmogorov test is completely independent 
from the DC test result if the zero hypothesis is valid. Fur- 
thermore, the application of Smirnovs variant (Smirnov 
1939| ) allows for the use of simultaneously acquired back- 



ground events to define the zero hypothesis in the case 
of nonuniform temporal acceptance. Just like for the exp- 
test, the Kolmogorov test result can also be determined 
from runnig sums during two sequential passes over the 
experimental data. 

6. Summary 

It has been shown that, given N time intervals 
{ATfe}fc = i. jv between each pair of consecutive event times 
of the sequence to test (resp. N Inter-Events {nk}k=i...N 
in the discrete case), the quantity 

M = £ E 

T k <C* 



resp. A4 



E 

n h <C* 



C 



is asymptotically normal distributed according to 
N(l/e - a/N,f3/VW) (resp. according to N(M {C*) - 
ac /N, 0c* /VN)) if the temporal sequence represents 
a Pois son" process (resp. mhomogcnous^Poisson process). 
The distribution of the time intervals F(Ar) resp. of the 
Inter-Events W{n) is easily accessible, and M(F) resp. 
Ai(W) are quickly calculable from it. In contrast to binned 



tests, the measure of the exp-test can be obtained from 
running sums when processing the data. The determina- 
tion of the significance of the exp-test 5 eX p therefore is 
almost as easy as the calculation of a DC significance. 

In observations that are limited by background fluc- 
tuations, obtained positive significances S CX p for a tested 
temporal sequence correspond to small chance probabil- 
ities and can directly be interpreted as significances for 
burst-like behaviour. For sufficiently small duty cycles, the 
significances from the exp-test are expected to be 1.5 times 
higher than the DC significances. The direct use of back- 
ground events acquired simultaneously matches perfectly 
the requirements of typical experiments in the earthbound 
7-ray astronomy. 

Asking for signals of variability in pure source events, 
the result of the exp-test represents a simple and broad- 
banded measure for deviations from a constant flux. A 
positive result of the exp-test in such situations supplies an 
immediate information about the minimum flux variations 
'I'max/'&min present in the sample. 

The results from the exp-test are by construction in- 
dependent from a DC excess or deficit, and unaffected by 
interrupts of the activity cycle as far as possible. The test 
can be applied as soon as the number N of events to test 
surmounts w 10. 

Although every test on variability contains arbitrary 
elements, special care was taken to find a nonartificial 
measure that is suggested by itself when studying the ele- 
mentary statistics of Poisson processes. From the view- 
point of the author, the exp-test fulfills this criterion. 
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